Amendments to the Specification: 

At page 7, lines 7-29, please substitute the following paragraph for the existing 

text: 

In preferred embodiments of this method more than one SAGE tags are 
simulateously identified. The multiple identification provides fr high-throughput. The high- 
throughput generation of longer SAGE tags for gene identification (GLGI) prodecure has 
several important features, for example, (i) 3 f cDNAs instead of full-length cDNAs are used as 
the templates for GLGI amplification. This prevents artificial amplification from non-specific 
annealing of snese primer. The 3' cDNAs can be amplified to provide sufficient templates for 
GLGI amplification; (ii) a single antisense primer (in one example the primer is: 5- 
ACTATCT AGAGCGGCCGCTT-3 ' (SEP ID NO: \2\ (see also Example 3)) is used for all 
GLGI reactions instead of using combination of the five anchored oligo dT primers. The 
sequence of the antisense primer is located in 3' end of all the cDNA templates incorporated 
from anchored oligo dT primers used for the first strand cDNA synthesis. Use of a single primer 
also increases the efficiency of GLGI amplification significantly as any annealing of this primer 
with 3' end sequence results in extension during PCR. This feature is particularly useful to 
amplify the templates with low copies; (iii) Use of PLATINUM Taq polymerase instead of Pfu 
DNA polymerase increases the yield of final products, while maintaining high specificity; (iv) 
the GLGI amplified DNAs are directly precipitated and cloned into vector without gel 
purification, which further prevents loss of amplified products. The inventors contemplate that 
this is especially important for products with short sizes and for products generated from 
templates with low copies. Thus, the methods of this invention provide the ability for large- 
scale identification of expressed genes. Genes of any eukaryotic origin, including human genes 
may therefore be identified at an accelerated rate by the simple, efficient and low-cost methods 
set forth herein. 



At page 20 through page 24, please replace Table 1 with the following substitute 



table: 



Table 1 : Restriction Enzymes 



Enzyme Name 


Recognition Sequence 




Aatll 


GACGTC 




Acco5 1 


GGTACC 




A ~ ~ T 

Acc 1 


GTMKAC 




A — I T 

Aci 1 


CCGC 




A «1 T 

Acl 1 


AACGTT 




Ate 1 


AGCGCT 




A -CI TT 
All 11 


CTTAAG 




A -fl TTT 
All 111 


ACRYGT 




Age I 


ACUCjCjI 




A V+A T 

And 1 


GACNNNNNGTC 


(SEO ID NO: 36) 


A 1,, T 

Alu 1 


AGCT 


A 1«» T 

Alw 1 


GGATC 




A 1„,\T T 

AlwN 1 


CAGNNNCTG 




A T 

Apa 1 


GGGCCC 




A ««T T 

Apa.L 1 


GTGCAC 




A « rt T 

ApO 1 


n A A r T"T" , "\7' 

RAATTY 




A ~ ~ T 

Asc I 


GGCGCGCC 




A T 

Ase I 


A mm a a m 

ATTAAT 




Ava 1 


CYCGRG 




A TT 

Ava 11 


GGWCC 




A - ™ TT 

Avr 11 


CCTAGG 




Bae 1 


NACNNNNGTAPyCN 


(SEO ID NO: 37) 


Bamti 1 


GGATCC 




Ban 1 


GGYRCC 




T) TT 

Ban 11 


GRGCYC 




Bbs 1 


GAAGAC 




Bbv I 


GCAGC 




BbvC I 


CCTCAGC 




Beg I 


CGANNNNNNTGC 


(SEO ID NO: 38) 


BciVI 


GTATCC 


Bell 


TGATCA 




Bfal 


CTAG 




Bgll 


GCCNNNNNGGC 


(SEO ID NO: 39) 


Bgl II 


AGATCT 




BlpI 


GCTNAGC 




Bmrl 


ACTGGG 




Bpm I 


CTGGAG 




BsaA I 


YACGTR 




BsaB I 


GATNNNNATC 


(SEO ID NO: 40) 


BsaHI 


GRCGYC 
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Bsal 


GGTCTC 


BsaJ I 


CCNNGG 


BsaWI 


WCCGGW 


BseRI 


GAGGAG 


Bsgl 


GTGCAG 


BsiEI 


CGRYCG 


BsiHKA I 


GWGCWC 


BsiWI 


CGTACG 


BslI 


CCNNNNNNNGG 


BsmA I 


GTCTC 


BsmB I 


CGTCTC 


BsmFI 


GGGAC 


BsmI 


GAATGC 


BsoB I 


CYCGRG 


Bspl286 I 


GDGCHC 


BspDI 


ATCGAT 


BspE I 


TCCGGA 


BspHI 


TCATGA 


BspMI 


ACCTGC 


BsrB I 


CCGCTC 


BsrDI 


GCAATG 


BsrFI 


RCCGGY 


BsrGI 


TGTACA 


Bsrl 


ACTGG 


BssH II 


GCGCGC 


BssKI 


CCNGG 


Bst4C I 


ACNGT 


BssS I 


CACGAG 


BstAPI 


GCANNNNNTGC 


BstB I 


TTCGAA 


BstE II 


GGTNACC 


BstF5 I 


GGATGNN 


BstNI 


CCWGG 


BstUI 


CGCG 


BstXI 


CCAJSTNNNNNTGG 


BstYI 


RGATCY 


BstZ17 I 


GTATAC 


Bsu36 I 


CCTNAGG 


Btgl 


CCPuPyGG 


Btrl 


CACGTG 


Cac8 I 


GCNNGC 


Clal 


ATCGAT 


Ddel 


CTNAG 


Dpnl 


GATC 


DpnII 


GATC 


Dral 


TTTAAA 


Dra III 


CACNNNGTG 


DrdI 


GACNNNNNNGTC 



(SEP ID NO: 41 ^ 



(SEP ID NO: 421 



(SEP ID NP: 43) 



(SEP ID NP: 44) 
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Eae I YGGCCR 

Eag I CGGCCG 

Ear I CTCTTC 

Eci I GGCGGA 

EcoN I CCTNNNNNAGG (SEP ID NO: 45) 

EcoO 1 09 I RGGNCCY 

EcoR I GAATTC 

EcoR V GATATC 

Faul CCCGCNNNN 

Fnu4H I GCNGC 

Fok I GGATG 

Fse I GGCCGGCC 

Fsp I TGCGCA 

Hae II RGCGCY 

Hae III GGCC 

Hga I GACGC 

Hha I GCGC 

Hinc II GTYRAC 

Hind III AAGCTT 

Hinfl GANTC 

HinPl I GCGC 

Hpa I GTTAAC 

Hpa II CCGG 

Hph I GGTGA 

Kas I GGCGCC 

Kpn I GGTACC 

Mbo I GATC 

Mbo II GAAGA 

Mfe I CAATTG 

Mlu I ACGCGT 

Mly I GAGTCNNNNN (SEP ID NO: 46) 

Mill I CCTC 

Msc I TGGCCA 

Mse I TTAA 

MslI CAYNNNNRTG (SEP ID NO: 47) 

MspAl I CMGCKG 

Msp I CCGG 

Mwo I GCNNNNNNNGC (SEP ID NO: 48) 

Nae I GCCGGC 

Nar I GGCGCC 

Nci I CCSGG 

Nco I CCATGG 

Nde I CATATG 

NgoMI V GCCGGC 

Nhe I GCTAGC 

Nla III CATG 

Nla IV GGNNCC 

Not I GCGGCCGC 
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Nrul 


TCGCGA 


Nsi I 


ATGCAT 


Nspl 


, RCATGY 


Pac I 


TTAATTAA 


PaeR7 I 


CTCGAG 


Pcil 


ACATGT 


PflFI 


GACNNNGTC 


PflMI 


CCANNNNNTGG (SEO ID NO: 49) 


Plel 


GAGTC 


Pme I 


GTTTAAAC 


Pmll 


CACGTG 


PpuMI 


RGGWCCY 


PshAI 


GACNNNNGTC (SEO ID NO: 50) 


Psi I 


TTATAA 


PspGI 


CCWGG 


PspOM I 


GGGCCC 


PstI 


CTGCAG 


Pvu I 


CGATCG 


Pvu n 


CAGCTG 


Rsa I 


GTAC 


RsrII 


CGGWCCG 


Sac I 


GAGCTC 


Sac II 


CCGCGG 


Sail 


GTCGAC 


Sap I 


GCTCTTC 


Sau3A I 


GATC 


Sau96 I 


GGNCC 


Sbfl 


CCTGCAGG 


Seal 


AGTACT 


ScrFI 


CCNGG 


SexA I 


ACCWGGT 


SfaNI 


GCATC 


Sfcl 


CTRYAG 


Sfil 


GGCCNNNNNGGCC ( SEO ID NO: 51) 


Sfol 


GGCGCC 


SgrAI 


CRCCGGYG 


Smal 


CCCGGG 


Smll 


CTYRAG 


SnaB I 


TACGTA 


Spel 


ACTAGT 


SphI 


GCATGC 


Sspl 


AATATT 


StuI 


AGGCCT 


Sty I 


CCWWGG 


Swal 


ATTTAAAT 


TaqI 


TCGA 


Tfil 


GAWTC 


Tlil 


CTCGAG 
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Tse I GCWGC 

Tsp45 1 GTSAC 

Tsp509 1 AATT 

TspR I CAGTG 

Tthlll I GACNNNGTC 

Xba I TCTAGA 

Xcml CCANNNNNNNNNTGG (SEP ID NO: 52) 

Xho I CTCGAG 

Xma I CCCGGG 

Xmn I GAANNNNTTC (SEP ID NO: 53) 



At page 41, lines 14-20, please substitute the following paragraph for the existing 

text: 

SAGE Tags. A group of SAGE tags 10 bases long were selected from the SAGE 
tag sequences database generated from epithelium cells of normal colon (Zhang et al., 1997) 
(http://ww r \v.ncbi.nlm.nih.gov/SAGE/sag e r e c.cgi?r e c~166) . Each selected SAGE tag sequence 
was searched in the UniGene database (http ://w\w.ncbi.nlm.nih.gov/S AGE/S AGEtag.cgi?tag) 
to identify it as a matched or an unmatched tag sequence. Each matched sequence was given the 
appropriate Unigene ID number. Both matched and unmatched tags were used in the 
experiments. 

At page 42, lines 16-1 7, please substitute the following paragraph for the existing 

text: 

Database search. All the sequences generated from the clones were searched 
using the BLAST program for alignment (http ://www.ncbi.nlm.nih.gov/BLAST/) . 

At page 51, lines 1-6, , please substitute the following recitation for the existing 

text: 

All collected sequences were matched to GenBank Database (NR and ESTs, 
httpV/www.ncbi.nlm.nih.gov/BLAST -A through BLAST. Any mismatch between the SAGE tag 
sequence used for GLGI amplification and the SAGE tag sequence of the matched sequence in 
database was considered as non-specific amplification, and these sequences were eliminated 
from further analysis. The matched sequence ID was used to search UniGene database to obtain 
the UniGene cluster ID. 

At page 44, line 26, to page 45, linel 7, please substitute the following recitation 
for the existing text: 
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Identify the correct sequence from multiple sequences that matched with the 
same SAGE Tag. When matching SAGE tag sequences in databases, a single SAGE tag may 
align with several sequences. For example, nine out of 40 SAGE tag sequences show matches 
to multiple Unigene Clusters (Zhang et al 9 1997). Other than sharing the same SAGE tag 
sequence, these matched sequences have no homology and are derived from various different 
tissues. To test this issue experimentally, 12 SAGE tags were used for amplification with cDNA 
samples from 24 different human tissues. Four out of these 12 tags generated multiple 
templates. For example, the SAGE tag (GTCATCACCA) (SEP ID NO: 17> generated five 
different sequences from five different tissues (fetal liver, skeletal muscle, spinal cord, trachea 
and colon), and two different sequences from the same tissue (spinal cord) (Table 4). All of 
these fragments contained the same SAGE tag sequence, but the rest of the sequences showed 
no homology. Among these sequences, the ones from colon tissue all matched the previous 
amplified sequences in the colon (Table 3). These data indicate that a SAGE tag itself may not 
be sufficient to serve as a unique identifier for a particular sequence, when several sequences 
share the same SAGE tag sequences. It is important to distinguish which one of the matched 
sequences is the correct sequence corresponding to the particular SAGE tag. To avoid the 
uncertainty when different sequences are expressed from different tissues, it will be necessary to 
generate the fragment from the same tissue used to generate the SAGE tag. The inventors' 
observations also indicate that relying only on a database search to identify the sequence 
corresponding to a SAGE tag may provide misleading information. Direct amplification of the 
specific template with the inventors strategy will be very useful for confirmation of the validity 
of a particular SAGE tag. 

At page 46, lines 1-5, please substitute the following recitation for the existing 

text: 



Table 3. Summary of GLGI results from SAGE Tags 



C A PUT TP„^„ 

bAlxb lags 


T T 

Unigene 


3' end 


Amplified by 


Length 


Match to 


(10 base) 


ID 


nucleotide 


anchored oligo 


of 


original 






in matched 


dT 


sequen 


seauence** 










ce (bs) 




nn a a ppttt a 
OvjAAOCj 1 1 1 A 


TT 1 r\ C A Ci A 

Hs. 105484 


dT/dG 


dT 


77 




t >5lL\J lU 1NU. 16) 










ALjA 1 CUCAACj 


Hs.50813 


dC/dG 


dC 


84 














LI 1A1 LKj 1 


Hs. 179608 


dT 


dT 


86 


+ 


(oJHJ 1JJ JNVJ. ZU) 










A PP A TPPTPP 


Hs.71779 


dC 


dC 


112 


+ 


( ocU ID IN (J. Z 1 ) 










r^TT^ A TP A pp a 

Lr 1 LA 1 LALLA 


Hs. 32966 


dC 


dC 


119 


+ 


toUiJ 1U ZZ) 










/"2 A A TP P 

LiALLALj 1 CjvjC 


Hs. 143131 


dC/dT 


dC 


135 


+ 


loxHv' 1U INL 1 . Z3 ) 










tiui l uu i uA 


Hs.3463 


dC 


dC 


148 




\ uL/V XJL/ 1NU. Z*t J 








At 1 oijrLr ILIA 


TI„ ino'll 

Hs. 227823 


dG 


dG 


150 


+ 


1 jdV ILJ L\yJ. ZD ) 












Hs. 105460 


dC 


dC 


166 


+ 


\ Oi_/V 1U L^iKJ. ZOI 










tuu I LtVjCjACC 


TT— C\C\ 'X 1 C 

Hs. 99175 


dC/dT/dG 


dC 


200 




( oxif 1JJ INL*. Z / ) 










PrTTP AAA Tf 

1 1 LAAA 1 L 


Hs.231 18 


dC/dT 


dC 


220 




loClJ 1U JNU. zo) 








P..P a p p pp r^T^r^ 


Hs. 33455 


dT/dG 


dT 


238 


+ 


(z>bij ID NO: 29) 








A A P- A A P A T A P 

AAuAAUA 1 ACj 


TT Tin jn 

Hs. 73848 


dT 


dT 


317 


+ 


(o-bU ID iNU: 30) 










riATrrrA apt 


rls. 1 lo /oo 


dG/dT/dC 


dG 


329 


+ 


(SEOIDNO: 311 










GAACAGCTCA 


Hs. 194659 


dT 


dG 


382 


+ 


(SEO ID NO: 32) 








AGGTGACTGG 






dC 


156 




(SEO ID NO: 331 










CACCTAGTTG 






dT 


170 




(SEO ID NO: 341 








CCTGTCTGCC 






dT 


249 




(SEO ID NO: 351 









*The y end nucleotides from all the sequences were included in each matched Unigene cluster. 
**The amplified sequences were matched to databases again. The last three sequences have no 
matches and represent novel sequences. 



At page 47, lines 1-2, please substitute the following recitation for the existing 

text: 



Table 4. Detection of heterogeneous sequences in various tissues 
containing the same SAGE Tag 



SAGE TAG 


Pn^itivp Hqqiip<; 


T Tnicrf^Tif* TO 


1 o"fn f\T cpniipnfp 


CGGTGGGACC 


Colon, Thymus, Small intestine 


Hs.99175 


200 


fSEOIDNO: 14) 


Small intestine 


no match 


368 




Thymus 


no match 


90 


AGATCCCAAG 


Colon, Heart, Placenta, Thymus 


Hs.50813 


84 


(oliU ID NO: 13) 


Placenta 


no match 


53 




Skeletal muscle 


Hs.85937 


282 




Testis 


no match 


227 




Thymus, Placenta 


no match 


51 


CTTATGGTCC 


Bone marrow 


Hs.237416 


393 


CSEOIDNO: 16) 


Bone marrow 


no match 


144 




Colon 


Hs. 179608 


86 


GTCATCACCA 


Fetal liver, Spinal cord 


Hs.222346 


125 


(SEOIDNO: 17) 


Skeletal muscle 


Hs.1288 


399 




Spinal cord 


Hs.9641 


394 




Trachea 


no match 


225 




colon 


Hs.32966 


136 



At page 48, lines 16-30, please substitute the following recitation for the existing 

text: 

The same RNA samples from human and mouse myeloid cells used for SAGE 
analysis were used as the templates for GLGI amplification. mRNAs from 5 ixg of total RNA of 
each sample were isolated with Oligo (dT) 2 5 Dynabeads (Dynal), following the manufacturer's 
protocol. Poly(dA/dT) cDNAs were synthesized using a cDNA synthesis kit (Cat. No: 18267- 
021, Life Technologies) and the 5 f biotinylated, 3 f anchored oligo (dT) primers were used for 
first strand cDNA synthesis (5 ! biotin-ATCTAGAGCGGCCGC-T 1 6-A,G ? CA,CG and CC) 
(SEOIDNOS: 1-5) (Wang et al y 2000). The double-strand cDNAs were then digested with 
Nlaffl, and 3* cDNAs were isolated with strep tavi din beads (Dynal), following the manufactures 
protocol. In order to generate enough 3 1 cDNAs for GLGI analysis, 3* cDNA templates were 
amplified by PCR as the following: SAGE linker A or B was ligated to the 3' cDNAs bound to 
the beads (Linker A: 5'- 

TTTGGATTTGCTGGTGCAGTACAACTAGGCTTAATAGGGACATG - 3' (SEP ID NO: 6) 
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and 5'- pTCCCTATTAAGCCTAGTTGTACTGCACCAGCAAATCC [amino mod. C7]- 3' 
(SEP ID NO: 7) : or Linker B: 5'- 

TTTCTGCTCGAATTCAAGCTTCTAACGATGTACGGGGA CATG - 3' (SEP ID NO: 8) and 
5'- pTCCCCGTACATCGTTAGAAGCTTGAATTCGAGCAG [amino mod. C7]- 3') (SEP ID 
NP: 9) (http://ww.oagonot.org/sag o jrotocol.htm) . The ligated 3' cDNAs were then amplified 
by 20 cycles of PCR at 94°C for 30 s, 55°C for 30 s, and 72°C for 30 s, with PLATINUM Taq 
polymerase (Life Technologies), SAGE sense primer (5'- GGATTTGCTGGTGCAG TACA - 3' 
(SEPIDNP: 10) for linker A; or 5'- CTGCTCGAATTCAAGCTTCT - 3' (SEP ID NP: in for 
linker B) ( - http://www.Bag o net.org/sage jrotocol.htm) and antisense primer (5' - 
ACTATCTAGAGCGGCCGCTT- 3') (SEP ID NP: 12) located in the 5' end of anchored oligo 
dT primers used for the first strand cDNA synthesis. The amplified templates were extracted by 
phenol/chloroform, precipitated by ethanol/NH 4 PAc/glycogen, and resuspended in TE buffer for 
GLGI amplification. 

At page 49, line 12, to page, line 29, please substitute the following recitation for 
the existing text: 

The sense primer used for GLGI amplification included 14 bases (CATG + 10 
base SAGE tag sequence) at the 3' end and 6 bases (GGATCC, BamH I sites) at the 5' of the 
primer, giving a total of 20 bases for each primer: 5'- GGATCCCAT GNNNNNNNNNN -3' 
(SEQ ID NP: 13) (Chen et al, 2000). Sense primers were synthesized in 96 well format and the 
concentration was adjusted to 50ng/ul with TE. GLGI master mixtures were prepared for each 
reaction, containing lx PCR buffer (20 mM TrisCl pH 8.4, 50 mM KC1), 2 mM MgCl 2 , 0.2 mM 
dNTPs, 1.5 units / 0.3 \i\ PLATINUM Taq polymerase, 60 ng / 1.2 ^il antisense primer (5'- 
ACTATCTAGAGCGGCCGCTT-3') ( SEP ID NP: 121 and 0.5 - 5 ng of 3' cDNAs. The 
reaction mixtures were aliquoted into a 96-well plate at 28.8 ju.1 per well. Sense primers (60 ng / 
1 .2 ul) were then added into each well. GLGI reactions were performed in PE GeneAmp PCR 
Systems 9600 or 9700. The conditions used were 94°C for 2 min, followed by five cycles at 
94°C for 30 s, 55°C for 30 s, and 72°C for 30 s. The conditions were then changed to 20-25 
cycles at 94°C for 30 s, 60°C for 30 s, and 72°C for 30 s. Reactions were kept at 72°C for 5 min 
for the last cycle. The amplified products were directly precipitated in the 96-well PCR plate by 
adding lOO^il of precipitation mixture to each well, containing 1 of glycogen (20 mg/ml, 
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Roche), 15}al of 7.5M NH 4 OAc and 84|ul of 100% ethanol. The plate was sealed with Tape pads 
(QIAGEN, Inc), vortexed, and kept at room temperature for 15 min. After spinning at 4000 rpm 
for 35 min at 4°C (SORVALL RC5C plus; rotor: SH3000), the supernatants were removed, 
150|al of 70% ethanol were added per well to wash the DNA, and the plate were spun at 4000 
rpm for 15 minutes. The supernatants were removed again, the pallets were air-dried, and 
dissolved in 5|ul of dH 2 0. Two jal of DNA, 0.7 ^1 of salt solution, 0.7 jal of water, and 6 ng of 
pCR4-TOPO vector were used for each ligation reaction with TOPO TA cloning kit for 
sequencing (Invitrogen). The ligation reactions were performed at room temperature for 25 min. 
For transformation, 2 jal of ligation were mixed with 50 fil of TOPO 10 competent cells 
(Invitrogen), kept on ice for 20 min, then heated at 42°C for 30 s, and moved on ice. SOC media 
(250 pi) were added per well. Plate was sealed, shaken at 37°C for 60 min at 225 rpm. The 
transformants were spread on LB plates containing 50 ng/ml of kanamycin and grew over night 
at 37°C. Positive clones were screened by direct colony-PCR. PCR master mixtures were 
prepared, containing lx PCR buffer (10 mM TrisCl pH 8.3, 50 mM KC1, 1.5 mM MgCl 2 ), 0.1 
mM dNTPs, 0.5 units / 0.1 |al Taq polymerase (TaKaRa), 60 ng of sense primer (M13 reverse 
primer) and 60 ng of antisense primer (M13 forward (-20) primer). The reaction mixtures were 
aliquoted into a 96-well plate at 25jal per well, and colonies were picked into the reaction 
mixtures with sterile pipette tips. PCR was performed in PE GeneAmp PCR Systems 9600 or 
9700. The conditions used were 94°C for 2 min, followed by 25 cycles at 94°C for 30 s, 55°C 
for 30 s, and 72°C for 60 s. The reactions were kept at 72°C for 5 min after the last cycle. 75(^1 
of precipitation mixture were added per well to precipitate DNAs, containing 22 jLtl of dH 2 0, 
15(^1 of 2M NaC10 4 and 38 jul of 2-propanol. The plate was sealed, vortexed, and kept at room 
temperature for 5 min. After spinning at 4000 rpm for 35 min at 4°C, the supernatants were 
removed, 150^1 of 70% ethanol were added per well to wash the DNA, and the plate were spun 
at 4000 rpm for 25 minutes. Supernatants were removed again, the pallets were air-dried, and 
dissolved in 10|il of dH 2 0. Sequencing mixtures were prepared in a total volume of 7jil , 
containing 0.8]ul of big-dye pre-mixture, 1.4jal of dilution buffer (400 mM TrisCl pH 9.0, 10 
mM MgCl 2 ), 30 ng / 0.3 jxl of sequence primer (Ml 3 reverse primer or Ml 3 forward (-20) 
primer), l.Sjal H20, and 3^1 of DNA templates. Sequencing reactions were performed at 96°C 
for 10 s, 50°C for 5 s, and 60°C for 4 min for 99 cycles. The final sequencing products were 
precipitated by adding 15\i\ of precipitation mixture, consisting of 64jli1 of 100% ethanol/3M 
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NaOAc mixture (25:1), lul of glycogen (20 mg/ml) and lOul dH 2 0. The plate was sealed, 
vortexed, and kept at room temperature for 15 min. After spinning at 4000 rpm for 35 min at 
4°C, the supernatants were removed, 150^1 of 70% ethanol were added per well to wash the 
DNA, and the plate were spun at 4000 rpm for 15 minutes. The supernatants were removed, the 
pallets were air-dried, and dissolved in 3ul of loading dye. One ul was loaded in 5% sequencing 
gels. Four to six clones were sequenced for higher abundant SAGE tags, and 8 to 12 clones 
were sequenced for low abundant SAGE tags. Sequences were collected with an ABI 377 
sequencer. 

At page 51, line 1 7, to page 52, line 8, please substitute the following recitation 
for the existing text: 

The high-throughput GLGI procedure has several differences as compared to the 
GLGI, for example, (i) 3' cDNAs instead of full-length cDNAs are used as the templates for 
GLGI amplification. This prevents artificial amplification from non-specific annealing of sense 
primer to sequences upstream of the last CATG. The 3 ' cDNAs can be amplified to provide 
sufficient templates for GLGI amplification; (ii) a single antisense primer (5'- 
ACTATCTAGAGCGGCCGCTT-3') (SEP ID NO: 12) is used for all GLGI reactions instead of 
using combination of the five anchored oligo dT primers. The sequence of the antisense primer 
is located in 3' end of all the cDNA templates incorporated from anchored oligo dT primers used 
for the first strand cDNA synthesis. The inventors have observed that the anchored oligo dT 
primers are unstable which can hinder the successful performance of GLGI. Use of the single 
primer also increased the efficiency of GLGI amplification significantly as any annealing of this 
primer with 3' end sequence results in extension during PCR. In contrast, the use of five 
anchored oligo dT primers results in an extension by PCR only when correctly paired primers 
anneal. This feature is particularly useful to amplify the templates with low copies; (iii) 
PLATINUM Taq polymerase instead of Pfu DNA polymerase was used for GLGI amplification, 
in order to increase the yield of final products, while maintaing high specificity; (iv) the GLGI 
amplified DNAs were directly precipitated and cloned into vector without gel purification, to 
prevent the loss of amplified products. This is contemplated be particularly important for 
products with short sizes and for products generated from templates with low copies. The 
inventors data showed that these changes significantly increase efficiency and specificity for 
GLGI amplification of 3' cDNAs, especially for templates expressed at low level. 
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